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APPARATUS FOR CONTROLLING MULTI-WORD STACK 
OPERATIONS IN DIGITAL DATA PROCESSORS 



BACKGROUND 



1. Technical Field; 

The present invention relates generally to micro-architecture design of 
digital data processors and, more particularly, to an apparatus for controlling a 
multibank stack storage device embedded in a digital data processor. 

2. Description of Related Art 

Many conventional programmable digital data processors, such as 
microcontroller units (MCUs) and digital signal processors (DSPs), use stack- 
based instruction sets. A data stack is a storage device that stores information in 
such a manner that the last stored information item is the first item retrieved, i.e., a 
stack is accessed in a Last-In, First-Out (LIFO) fashion. 

In general, two operations of a stack are the insertion and deletion of stack 
items. When a new item is inserted to the stack, it is said to be "pushed" onto the 
stack. Conversely, when an item is deleted from the stack, it is said to be 
"popped" off the stack. The stack may store program data such as operands or 
their results, or address pointers, subroutine parameters, and register contents for 
either subroutine calls and returns or interrupt acknowledges and returns. All 
"push" and "pop" operations are performed by utilizing the top of the stack. In 
other words, the top of stack (TOS) is the memory location that is normally read 
or written. The register that holds the address for the stack is called a stack 
pointer (SP). This pointer indicates the location of the top of the stack. 

In a system (e.g., a computer) having a digital data processor chip, a stack 
may exist as a stand-alone unit or may be implemented in a random access 
memory (RAM) unit attached to the processor chip. This stack is organized from 
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a software point of view, and is generally called a "software" stack (sometimes 
called a "memory" stack). Also, a stack can be organized as a collection of a finite 
number of registers embedded within a digital processor chip. This type of stack 
is called a "hardware" stack (sometimes called a "register stack"). 

The software stack is convenient to use because it allows a user to adjust 
the stack to a desired size. But, its structure is disadvantageous in view of 
operating speed and power dissipation since it is required to access external units. 
On the other hand, the embedded hardware stack structure is suitable for meeting 
the higher stack operation requirements, for example, a context switching (in 
which all the contents of several internal registers of the processor chip are saved 
on the stack for a short time). Also, the hardware stack requires low power 
consumption because there is no need for accessing any external unit such as a 
memory. Consequently, such stack-embedded processors have been used for 
small battery-powered systems having low power consumption requirements, such 
as mobile telephones. The drawback of the embedded stack is that users cannot 
change its size. 

Fig. 1 shows the organization of a conventional hardware stack. Referring 
to Fig. 1, the stack 10 has the storage capacity of 32 words. The stack pointer 
(SP) 20 contains a binary number whose value is equal to "the current stack top 
word address (A T0S ) + 1 ". For a stack that can store 32 words, the stack pointer 20 
should contain 5 bits since 2 5 = 32. A stack pointer 20 having five bits cannot 
exceed a number greater than 31 ('1 1 1 IT in binary). 

'-As-tJtes trat e d in Fig. 1, five items a re pla c e d in th o 35-werd-stack IOjJ^B ,^ 
C, D, and E in the order shown. With item E on top lofJh^tackTthTcontent of the 
stack pointer 20 is 5 (i.e., WlOVrnjometryyTfo^sert a new item, the stack 
storage lOJsJ^piisted^^rstoring the new item into the location indicated by the 
stack pointe r 20 (i.e., t h e l ocation at add re ss '5 ' ) and th en i nc reme n t ing the st a ck 
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)oint to the next - higher order location at address '6' 
removal of the top item E at address '4', the stack storggs4fl^"popped" by 
decrementing the content of theslack-ptf^^ first and then retrieving the top 
item E fromjhe4oc3^ address '4', so that the stack pointer 210 contains '4' to 
ficatc the top locatiofl r^ 

Typically, digital data processors are categorized in terms of the number of 
binary bits in the data they process, i.e. 3 their word length. For example, an 8-bit 
processor to process information by 8 bits (1 byte) has an 8-bit wide stack, hence 
only one word (i.e., 8 bits) is able to be pushed onto the stack at a time. 

A digital processor will generally be designed to have its address bit width 
larger than its data bit width in order to acquire an adequate amount of memory 
address space. For example, an 8-bit processor may have its address being 16 bits 
or more in size even though its data word is only 8-bit wide. In such a case, to 
push or pop a 16-bit address (e.g., for a subroutine call or return operation) onto 
or off an 8-bit stack, the 16-bit address should be first divided into two 8-bit 
portions, and then the two portions are pushed/popped on/off the stack in twice, 
which causes degradation in processor performance. Accordingly, a digital 
processor that provides improved performance of stack-based operations is 
desired. 



20 SUMMARY OF THE INVENTION 

The present invention is directed to an improved hardware stack for digital 
data processors. In one aspect of the present invention, a hardware stack 
comprises an instruction decoder generating a plurality of decoding signals, each 
of the plurality of decoding signals denoting one of a plurality of stack operations, 
25 a stack storage comprising a plurality of storage locations, each of the plurality of 
storage location being classified into one of at least two banks, and a stack 
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pointer circuit for pointing to at least one of the stack banks of the stack storage in 
response to at least one decoding signal to thereby cause a stack operation. 

In another aspect of the present invention, a digital data processor 
comprises a stack storage having a plurality of locations each accommodating a 
one-word item only, and a stack pointer circuit that points to one or more 
locations of the stack storage at a one time. The locations of the stack storage is 
classified into two or more banks. In addition, an instruction decoder is provided 
for decoding an instruction and generating a plurality of decoding signals which 
denote a one-word stack operation or multi-word stack operations. The stack 
pointer circuit points to one or more stack banks at a time in response to the 
decoding signals such that a one-word item or a multi-word item can be pushed 
into or popped from the stack banks at a time. 

According to yet another aspect of the present invention, a digital processor 
preferably comprises a stack storage having two banks, a stack pointer for the 
stack storage, and a stack storage control circuit which inserts and removes either 
a one-word item or a two-word item into and from the stack storage at a time on 
the basis of a content of the stack pointer. The two-word item is inserted into and 
removed from two adjacent locations of the stack storage at a time. The stack 
storage control circuit increases or decreases the content of the stack pointer by 
one when the decoding signals indicate a one-word stack operation. Additionally, 
the stack storage control circuit increases or decreases the content of the stack 
pointer by two when the decoding signals indicate a two-word stack operation. 

According to another aspect of the present invention, the digital data 
processor comprises a main stack pointer, first and second stack pointers, and a 
stack pointer control logic circuit. The main stack pointer, the first and second 
stack pointers are provided for pointing to one of the locations of the stack 
storage, to one of the locations of the first bank, and to one of the locations of the 
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second bank, respectively. The instruction decoder decodes a stack-based 
instruction and generates a plurality of decoding signals which denote one-word 
push, one- word pop, two- word push and two- word pop operations. The stack 
pointer control logic circuit controls the first and second bank stack pointers in 
response to the decoding signals such that a one-word item or a two-word item is 
inserted into and removed from the stack storage at a time on the basis of a 
content of the main stack pointer. The first and second banks preferably have the 
same size. One of the first and second banks includes the locations whose 
addresses have least significant bits of O's and the other of the first and second 
banks includes the locations whose addresses have least significant bits of l's. 

These and other aspects, features and advantages of the present invention 
will be described and become apparent from the following detailed description of 
preferred embodiments, which is to be read in connection with the accompanying 
drawings in which like reference symbols indicate the same or similar 
components. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a schematic diagram illustrating a conventional hardware stack 
architecture for digital processors; 

Fig. 2 is a block diagram illustrating a hardware stack according to a 
preferred embodiment of the present invention; 

Fig. 3 is a high level block diagram illustrating a detailed circuit 
configuration of a stack pointer circuit according to an embodiment of the present 
invention; 

Figs. 4A through 5E are schematic views which illustrate methods of 
operation of the stack pointer circuit according to the present invention; 

Fig. 6 is a high level block diagram illustrating a stack pointer circuit 
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according to another embodiment of the present invention; and 

Figs. 7A and 7B are tables showing content variations of the stack pointers 
of Fig. 6 in accordance with stack instructions and content of the main stack 
pointer. 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 

The present invention relates to an improved hardware stack for digital data 
processors. In order to provide a thorough understanding of the present invention, 
the common architecture of digital data processors will be explained before the 
description of the preferred embodiments. 

A digital data processor typically contains an arithmetic and logic unit 
(ALU), a set of registers (or register file), and a control unit (CU). These three 
sections are connected through internal buses. 

One function of the ALU is to perform arithmetic operations such as add, 
subtract, multiply and divide on operands presented to it by the CU. Another 
function of the ALU is to perform boolean operations, such as NOT, AND, OR, 
and EXclusive-OR. Other operations performed by the ALU include clear, 
increment, rotate and shift in both arithmetic and logic form. 

The register set includes both general- and special-purpose types, as is well 
known to those skilled in the art. One special register is the program counter (PC) 
which points to the memory location where the next sequential instruction is to be 
found. This register is automatically updated following the execution of the 
current instruction unless the current instruction is a branch or jump. Another 
special register is the instruction register (IR) which temporarily stores 
instructions that are fetched from memory. The stack pointer (SP) is a special 
register which holds the address of a stack unit for the temporary storage of 
variables. Additionally, an accumulator (ACC) and status register (SR) are used 
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for special purposes. The ACC holds data fetched from memory, passes this data 
onto the ALU for some calculation, receives results back from the ALU and then 
passes these results back to memory. The ALU operations affect flag bits in the 
SR. These bits can be subsequently used to test for conditional jumps, which are 
5 taken depending on whether a carry, overflow, negative or zero result was 
obtained. 

Typically, program execution in a digital data processor involves such steps 
as repeatedly fetching, decoding and executing these instructions, one at a time. 
An instruction fetch involves loading the IR with the contents of a particular 
10 memory location, followed by incrementing the PC to the next sequential location 
unless the current instruction is a branch or jump. Instructions fetched from 
>J memory are passed via the IR to the instruction decoder for decoding and 

subsequent execution. The instruction decoder determines what operation needs 
Gjl to be performed. The CU carries out the decoded instruction by selecting the 

SI 15 appropriate combination of individual control lines. 

¥ Preferred embodiments of the present invention will now be described in 

m detail with reference to the accompanying drawings. Reference is made first to 

^ Fig. 2, which illustrates an architecture of a hardware stack device 500 according 

*f to a preferred embodiment of the invention. The stack device 500 is intended to 

20 be incorporated within a programmable digital data processor, such as a 
microprocessor, a microcontroller unit, or a digital signal processor. 

Referring to Fig. 2, the stack device 500 is connected with a register set 
400 to provide a temporary storage for the register set 400. The stack device 500 
comprises a stack pointer circuit 200 and a stack storage 300. The stack storage 
25 300 is subdivided into two banks BANK0 and BANK1 . The first stack bank 
BANK0 comprises stack locations whose least significant bits (LSBs) of 
addresses are 0's and the second stack bank BANK1 comprises other locations 
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whose LSBs of addresses are Ts. The stack pointer circuit 200 comprises a main 
stack pointer (MSP) 210 of n+1 bits, two n bit-wide bank stack pointers (BSPs) 
230 and 240 for pointing to the tops of the stack banks BANK0 and BANK1 5 
respectively, and a stack pointer control logic circuit 220 for controlling the stack 
pointers 210, 230 and 240, where f n' is a positive integer. The main stack pointer 
(sometimes called a user stack pointer) 210 is designed to contain a binary number 
whose value equals "the top word address (A T0S ) +1". A stack storage control 
circuit, comprising the first and second bank stack pointers 230, 240 and the stack 
pointer control circuit 220, inserts and removes either a one-word item or a two- 
word item into and from the stack storage 300 at a time on the basis of a content 
of the main stack pointer 210 in response to a plurality of decoding signals from 
an instruction decoder 100. 

Fig. 3 is a high level block diagram illustrating a detailed circuit 
configuration of a stack pointer circuit 200 according to-and-embodiment of the 
present invention. Referring to Fig. 3, the stack pointer circuit 200 comprises a 
stack pointer control logic circuit 220a for controlling the stack pointers 210, 230 
and 240, which comprises an adder 221, first, second, and third multiplexers 
(MUXs) 222, 225 and 226 serving as selectors, an increment logic circuit 223, and 
first and second MUX control logic circuits 224 and 227. The multiplexers 222, 
225 and 226 are preferably 2x1 multiplexers, each having two inputs INI, IN2 
and an output OUT. The characters 'S' and *E f for each of the multiplexers 222, 
225 and 226 denote a select terminal and an enable terminal, respectively. 

The adder 221 adds '+1*, '+2', '-1', or '-2' to the content MSP<n:0> of the 
main stack pointer 210 in response to a first instruction decoding signal DCSA1 
from the instruction decoder 100, which indicates the types of stack-based 
instructions. Specifically, if the first instruction decoding signal DCSA1 indicates 
a one-word (i.e., 8-bit) push operation, the adder 221 adds f +l' and the content 
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MSP<n:0> of the stack pointer 210. If the first instruction decoding signal 
DCSA1 indicates a one-word pop operation, the adder 221 adds '-1' to the content 
MSP<n:0>. For a two-word (i.e., 16-bit) push operation, f +2' is added to the 
content MSP<n:0>; and for a two-word pop instruction, l -2' is added. 
5 The first multiplexer 222 selects one of the content MSP<n:0> of the main 

stack pointer 210 and the content ADD<n:0> of the adder 221 in response to a 
second instruction decoding signal DCSA2 provided from the instruction decoder 
100, which functions as a "select" control signal for the first multiplexer 222. 
Specifically, when the instruction decoder 100 indicates a push operation by 
10 generating a second instruction decoding signal DCSA2 of logic '0', the first 
multiplexer 222 selects the output MSP<n:0> of the main stack pointer 210. 
^ Thereafter, the main stack pointer 2 1 0 is updated by the resultant content 

^ ADD<n:0> of the adder 221 . On the other hand, when the instruction decoder 

W 

100 indicates a pop operation by generating an instruction decoding signal 
H 1 5 DCS A2 of logic ' 1 the first multiplexer 222 selects the output ADD<n:0> of the 
s adder 22 1 , and then the main stack pointer 2 1 0 is also updated by the resultant 

% value ADD<n:0> of the adder 22 1 . 

if! I 

=1 For the sole purpose of illustration, and by way of example with reference 

to Fig. 2, the stack storage 300 is assumed to have the storage capacity of 32 

20 words, with each of the stack banks BANK0 and BANK1 having the storage 

capacity of 16 words. With this example, the main stack pointer 210 is assumed 
to contain 5 bits of MSP<4:0> (since 2 5 = 32) and each of the bank stack pointers 
230 and 240 is assumed to be 4-bit wide (since 2 4 = 16). In addition, it is assumed 
that (1) the stack pointer circuit 200 of the invention is incorporated within an 8- 

25 bit processor intended to process 8-bit data (1 -byte-wide); (2) the stack storage 
300 is 8-bit wide; and (3) the 8-bit processor is loaded with 8- (single- or one- 
word) or 16-bit (double- or two-word) wide stack items. An example of the 16-bit 
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items (i.e., two-word items) includes an address stored in the program counter 
(PC). 

Of the five bits MUX1<4:0> output from the first multiplexer 222, the least 
significant bit MUXKO is applied to first and second MUX control logics 224 
and 227, and the remaining four high-order bits MUX1<4:1> output from the 
first multiplexer 222 are provided to an increment logic circuit 223. The 
MUX1<4:1> are also provided to first inputs INI of the respective second and 
third multiplexers 225 and 226. 

The increment logic circuit 223 increments the MUX1<4:1> (which is 
output from the first multiplexer 222) by 1. The content INC<3:0> output from 
the increment logic circuit 223 is provided to the second inputs IN2 of the 
respective multiplexers 225 and 226, each of which selects one of the MSP<4:1> 
from the first multiplexer 222 and the output INC<3:0> of the increment logic 
circuit 223 in response to the corresponding select control signal SA1 or SA2. 
The output MUX2<3:0> of the second multiplexer 225 is provided to the second 
bank stack pointer 240, while the output MUX3<3:0> of the third multiplexer 226 
is provided to the first bank stack pointer 230. 

The first MUX control logic circuit 224 receives the MUX1<0> from the 
first multiplexer 222 and a third instruction decoding signal DCSA3 from the 
instruction decoder 100, and generates a first MUX enable signal EA1 and a select 
control signal SA1 for the second multiplexer 225. The second MUX control 
logic circuit 227 receives the MUX1<0> from the first multiplexer 222 and a 
fourth instruction decoding signal DCSA4 from the instruction decoder 100, and 
generates a second MUX enable signal EA2 and a select control signal SA2for the 
third multiplexer 226. 

When the MUXKO is logic T and the decoding signals DCS A3 and 
DCSA 4 indicate a one-word operation, the first MUX control logic circuit 224 
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enables the second multiplexer 225, while the second MUX control logic circuit 
227 disables the third multiplexer 226. Conversely, when the MUX1<0> is logic 
'0 ? and the decoding signals DCSA3 and DCSA 4 indicate a one-word operation, 
the first MUX control logic circuit 224 disables the second multiplexer 225 while 
the second MUX control logic circuit 227 enables the third multiplexer 226. In 
short, during a one-word operation, either of the first and second MUX control 
logic circuits 224 and 227 generates the respective select control signals SA1 or 
SA2 of logic f 0 ! , so that the second or third multiplexer 225 and 226 selects the 
MUX1<4:1> from the first multiplexer 222. 

On the other hand, when the instruction decoding signals DCSA3 and 
DCSA4 indicate a two-word operation, the second and third multiplexers 225 and 
226 both are enabled by the first and second MUX control logic circuits 224 and 
227, respectively, regardless of the logic state of the MUX1<0>. 

Additionally, if the MUX1<0> is logic '0 1 during the two-word operation, 
then the first and second MUX control logic circuits 224 and 227 generate a select 
control signal SA1 of logic T and a select control signal SA2 of logic '0', 
respectively, so that the second multiplexer 225 selects the output INC<3:0> of 
the increment logic circuit 223 and the third multiplexer 226 selects the 
MUX1<4:1> from the first multiplexer 222. In contrast, if the MUX1<0> is logic 
T during the two-word operation, then the first and second MUX control logic 
circuits 224 and 227 generate a select control signal SA1 of logic ? 0 f and the select 
control signal SA2 of logic T, respectively, so that the second multiplexer 225 
selects the MUX1<4:1> from the first multiplexer 222 and the third multiplexer 
226 selects the output INC<3:0> of the increment logic circuit 223. 

Hereinafter, various modes of operation of the stack pointer circuit 200 of 
Fig. 3 will be explained with reference to Figs. 4A through 5E, wherein the 
numbers within the respective blocks (i.e., locations) of the two banks BANKO 
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and BANK1 represent the addresses of the stack storage 300; the numbers outside 
the blocks represent the bank addresses of the corresponding bank; the dashed 
blocks represent the locations having items; the bold box represents the result of a 
push operation; and the dashed box represents the result of a pop operation. 

Fig. 4A illustrates a situation where the main stack pointer 210 points to the 
location at an even address f 4' (= '00100 1 in binary) of the stack storage 300 5 i.e., a 
bank address '2' (= '0010' in binary) of the first bank BANK0. Figs. 4B through 
4E illustrate the results of a one-word push, a two-word push, a one-word pop, 
and a two-word pop operation, respectively, based on the example shown in Fig. 
4A. Each of these stack operations will now be described in further detail. 

To begin, referring now to Fig. 3 and Fig. 4B, during a one-word push 
(e.g., 8-bit push) operation, the stack pointer circuit 200 of Fig. 3 operates as 
follows. The adder 221 adds '+1 ? to the content of the main stack pointer 210 in 
response to the instruction decoding signal DCSA1 from the instruction decoder 
100. The first multiplexer 222 selects the content ('00100') of the main stack 
pointer 210 in response to the instruction decoding signal DCSA2 from 
instruction decoder 100. Since the MUXIO (output from the first multiplexer 
222) is logic '0', the second multiplexer 225 is disabled by the first MUX control 
logic circuit 224 in response to the instruction decoding signal DCS A3 from the 
instruction decoder 100, whereas the third multiplexer 226 is enabled by the 
second MUX control logic circuit 227 in response to the instruction decoding 
signal DCSA4 from the instruction decoder 100. The third multiplexer 226 
selects the MUX1<4:1> of '0010' (= 2) in response to the select control signal 
SA2 from the second MUX control logic circuit 227. As a result, the first bank 
stack pointer 230 contains a bank address of '2'. Accordingly, a new one-word 
stack item from the register set 400 is stored in the location at the bank address '2' 
of the first bank BANK0, i.e., at the address '4 f of the stack storage 300. Finally, 
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the main stack pointer 210 is updated by the content ('5') of the adder 221, so that 

the pointer 210 points to the next location at address '5* of stack storage 300. 

Next, during a two-word push operation (e.g., 16-bit push), the first 

multiplexer 222 also selects the content ( f 00100 ! ) of the main stack pointer 210 in 

response to the instruction decoding signal DCSA2. Both the second and third 

multiplexers 225, 226 are enabled independent of the LSB MUX1<0> (which is 

output from the first multiplexer 222) because of the two- word operation. The 

second and third multiplexers 225, 226 both select the MUX1<4:1> ('0010') in 

response to the respective select control signals SA1 and SA2, so that the bank 

stack pointers 230 and 240 each contain the same bank address of '2\ 

Accordingly, a new two-word stack item from the register set 400 is stored in the 

locations at the bank addresses '2s' of the respective banks BANK0 and BANK1, 

i.e., at the addresses f 4 f and '5 ! of the stack storage 300. In this operation, the 

adder 221 adds '+2 1 to the content ('4') of the main stack pointer 210 and the main 

stack pointer 210 is updated by the content of the adder 221, allowing the pointer 

210*peints*to the next location at address ! 6' of stack storage 300 as shown in Fig. 
A 

4C. 

Next, with the one-word pop operation, the adder 221 adds f -r to the 
content ('4') of the main stack pointer 210. The first multiplexer 222 selects the 
content ( f 0001 1') of the adder 221 in response to the instruction decoding signal 
DCSA2. Since the MUX1<0> is logic T, the second multiplexer 225 is enabled 
while the third multiplexer 226 is disabled. The second multiplexer 225 selects 
the MUX1<4:1> of '000 T in response to the select control signal SA1 from the 
first MUX control logic circuit 224. As a result, the second bank stack pointer 
240 contains a bank address of T. Accordingly, the one- word item is removed 
from the location at the bank address T of the second bank BANK1, i.e., at the 
address '3' of the stack storage 300. Finally, the main stack pointer 210 is updated 
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by the content ('3') of the adder 221, so that the pointer 210 points to the top 
location at address '3' of stack storage 300 as illustrated in Fig. 4D. 

Next, during a two-word pop operation, the adder 221 adds '-2 1 to the 
content ( f 4 f ) of the main stack pointer 210, so that the adder 221 contains '2\ The 
5 first multiplexer 222 selects the content ('00010') of the adder 221 in response to 
the instruction decoding signal DCSA2. In this operation, both the second and 
third multiplexers 225, 226 are enabled. The second and third multiplexers 225, 
226 each select MUX1<4:1> of '0001' from the first multiplexer 222 in response 
to the respective select control signals SA1 and SA2, so that the bank stack 
10 pointers 230 and 240 each contain the same bank address of T. Accordingly, the 
two-word item is deleted from the locations at the bank addresses Ts' of the 
J respective banks BANK0 and BANK1, i.e., at the addresses '2 f and '3' of the stack 

^ storage 300. The main stack pointer 210 is also updated by the content ('2') of the 

adder 221, allowing the pointer 210 to point to the top location at address '2' of 

^ 15 stack storage 300 as shown in Fig. 4E. 

W 

b Fig. 5 A illustrates the situation where the main stack pointer 210 points to 

SI the location at an odd address f 5' (= '00101 1 ) of the stack storage 300, i.e., a bank 

J{j address '2' (= '0010') of the second bank BANK1 . Figs. 5B through 5E illustrate 

, Jr. 

*\ results of a one-word push, a two-word push, a one-word pop, and a two-word 

1st? 

20 pop operation, respectively, based on the example shown in Fig. 5A. Each of 
these stack operations will now be described in further detail. 

To begin, during a one-word push operation, the first multiplexer 222 
selects the content ( ! 00101') of the main stack pointer 210 in response to the 
instruction decoding signal DCSA2. The second and third multiplexer 225 and 

25 226 are enabled and disabled, respectively, because the MUX1<0> is T. The 

second multiplexer 225 selects the MUX1<4:1> of '0010' in response to the select 
control signal SA1 from the first MUX control logic circuit 224. As a result, the 
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second bank stack pointer 240 contains a bank address of '2'. Accordingly, a new 
one-word item is stored in the location at the bank address '2' of the second bank 
BANK1, i.e., at the address '5 f of the stack storage 300. The adder 221 adds '+1' 
to the content ('5') of the main stack pointer 210, and the main stack pointer 210 is 
updated by the content ('6 1 ) of the adder 221, so that the pointer 210 points to the 
next location at address '6' of stack storage 300 as illustrated in Fig. 5B. 

Next, during a two-word push operation, the first multiplexer 222 selects 
the content ('001 01' = 5) of the main stack pointer 210 in response to the 
instruction decoding signal DCSA2. The second and third multiplexers 225, 226 
are both enabled because of two-word operation. The increment logic circuit 223 
increases the MUX1<4:1> of '0010' by one. The second multiplexer 225 selects 
the MUX1<4:1> of f 2' in response to the select control signal SA1 from the first 
MUX control logic circuit 224 and the third multiplexer 226 selects the content 
('001 T) of the increment logic circuit 223 in response to the select control signal 
SA2 from the second MUX control logic circuit 227. As a result, the first bank 
stack pointer 230 contains a bank address of '001 V (= 3) and the second bank 
stack pointer 240 contains a bank address of '0010' (= 2). Accordingly, a new 
two-word item is inserted into the locations at the bank addresses '2 1 and '3 1 of the 
respective banks BANK1 and BANK0, i.e., at the addresses '5 1 and ! 6' of the stack 
storage 300. In this operation, the adder 221 adds '+2' to the content ('5 f ) of the 
main stack pointer 210 and the main stack pointer 210 is updated by the content of 
the adder 221, allowing the pointer 21 0-poiflts-to the next location at address 7' of 
stack storage 300 as shown in Fig. 5C. 

Next, during a one-word pop operation, the adder 221 adds 1 1 to the 
content ('5') of the main stack pointer 210. The first multiplexer 222 selects the 
content ( f 00100 f = 4) of the adder 221 in response to the instruction decoding 
signal DCSA2. Since the MUXIO is '0', the third multiplexer 226 is enabled 
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while the second multiplexer 225 is disabled. The third multiplexer 226 selects 
the MUX1<4:1> of '00 10' in response to the select control signal SA1. As a 
result, the first bank stack pointer 230 contains a bank address of '2\ Accordingly, 
the one-word item is removed from the location at the bank address '2' of the first 
bank BANKO, i.e., at the address '4' of the stack storage 300. Thereafter, the main 
stack pointer 210 is updated by the content ('4 ? ) of the adder 221, so that the 
pointer 210 points to the top location at address '3' of stack storage 300 as 
illustrated in Fig. 5D. 

Next, during a two-word pop operation, the adder 22 1 adds -2 ? to the 
content ('5') of the main stack pointer 210 in response to the instruction decoding 
signal DCSA1 and contains f 3\ The first multiplexer 222 selects the content 
( f 0001 T = 3) of the adder 221 in response to the instruction decoding signal 
DCSA2. The increment logic circuit 223 increases the MUX1<4:1> of '0001' by 
one. The second and third multiplexers 225, 226 are both enabled in this 
operation. The second multiplexer 225 selects the MUX1<4:1> in response to the 
select control signal SA1 and the third multiplexer 226 selects the content ('0010') 
of the increment logic circuit 223 in response to the select control signal SA2. As 
a result, the first bank stack pointer 230 contains a bank address of '2' and the 
second bank stack pointer 240 contains a bank address of T. Accordingly, the 
two-word item is removed from the locations at the bank addresses '2' and T of 
the respective banks BANKO and BANK1, i.e., at the addresses '4' and f 3 f of the 
stack storage 300. Finally, the main stack pointer 210 is updated by the content 
('3') of the adder 221, allowing the pointer 210 to point to the top location at 
address '3 1 of stack storage 300 as shown in Fig. 5E. 

Fig. 6 illustrates a detailed circuit architecture of a stack pointer circuit 200 
of Fig. 2 according to another embodiment of the invention. Referring to Fig. 6, 
the stack pointer circuit 200 comprises a stack pointer control logic circuit 220b 
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for controlling the stack pointers 210, 230 and 240, which includes an adder 251, 
three 2x1 multiplexers 252, 253 and 254, and a control logic circuit 255. 

The high-order bits MSP<n:l> of the main stack pointer 210 are applied to 
the adder 221. The adder 221 adds f +l f or to the MSP<n:l> in response to a 
instruction decoding signal DCSB1 from the instruction decoder 100, which 
indicates the types of stack-based instructions. The least significant bit MSP<0> 
is applied to the control logic circuit 255, along with an instruction decoding 
signal DCSB2 from the instruction decoder 100, which indicates the types of 
stack-based instructions. The control logic circuit 255 controls the multiplexers 
252-254 and the least significant bit MSP<0> of the main stack pointer 210 in 
response to the MSP<0> and the instruction decoding signal DCSB2. The 
multiplexers 253 and 254 are exclusively enabled by MUX enable signals EB1 
and EB2 from the control logic circuit 255, respectively, during one- word 
operations, and are both enabled during two-word operations. The MSP<n:l> is 
commonly applied to first inputs INI of the multiplexers 252-254 and the content 
ADD<n-l :0> of the adder 25 1 is applied to second inputs IN2 thereof. The 
output MUXl<n-l :0> of the first multiplexer 252 is fed back to the main stack 
pointer 210. The output MUX2<n-l :0> of the multiplexer 253 is provided to the 
second bank stack pointer 240. The output MUX3<n-l :0> of the multiplexer 254 
is provided to the first bank stack pointer 230. The respective multiplexers 252- 
254 selectively input one of the MSP<n:l> and the ADD<n-l :0> in response to 
select control signals SB1, SB2 and SB3 from the control logic circuit 255, 
respectively. The control logic circuit 255 generates a control signal SB4 for 
toggling the MSP<0>. 

Fig. 7A is a table which illustrates content variations of stack pointers of 
Fig. 6 in accordance with the stack instructions in the situation wherein the main 
stack pointer contains an even address. Fig. 7B is a table which illustrates content 
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variations of stack pointers of Fig. 6 in accordance with the stack instructions in 
the situation wherein the main stack pointer contains an odd address. 

Various examples of stack operations of the stack pointer circuit 200 of 
Fig. 6 will now be described with reference to Figs. 4A-5E, 6, 7A, and 7B. For 
5 the purposes of explanation, it is assumed that the main stack pointer 210 contains 
5 bits of MSP<4:0> and each of the bank stack pointers 230 and 240 is 4-bit wide. 

Referring initially to Figs. 4A, 6 and 7 A, a one-word push (e.g., 8-bit push) 
operation will now be described based on the above assumption that the main 
stack pointer 210 points to the location at the even address of '4' (= 00100 in 
10 binary). With this stack operation, the multiplexer 254 selects the MSP<4:1> of 
'00 10 1 (= 2) in response to the MUX enable signal EB2 and the select control 
^ signal SB3 from the control logic circuit 255, whereas the multiplexer 253 is 

W disabled by the MUX enable signal EB1 from the control logic circuit 255, so that 

W 

W the first bank stack pointer 230 contains '2'. Accordingly, a new one-word item is 

s|lts 

SI 15 stored in the location at the bank address '2' of the first bank BANK0, i.e., at the 

p - 3 

b address '4' of the stack storage 300. At this time, the multiplexer 252 selects the 

.ISlSS. 

iL,| 

ff% MSP<4: 1> in response to the select control signal SB1 from the control logic 

~J circuit 255, and then outputs MUX1<3:0> of '2' which is fed back to the main 

stack pointer 210. The control logic circuit 255 then generates control signal SB4 
20 to toggle the MSP<0> of logic f 0 ! (refer to Fig. 7A), so that the main stack pointer 
contains '00101' (= 5) as illustrated in Fig. 4B. 

Next, during a two-word push (e.g., 16-bit push) operation (assuming as in 
the above example that the main stack pointer 210 contains the address '4'), the 
adder 251 adds '+1' to the MSP<4:1> of '2' in response to the instruction decoding 
25 signal DCSB1 from the instruction decoder 100. The multiplexers 253 and 254 
both are enabled in response to the MUX enable signals EB1 and EB2 from the 
control logic circuit 255, respectively, and they select the MSP<4:1> of '2', so that 
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the bank stack pointers 230 and 240 each contain the same bank address of '2'. 
Accordingly, a new two-word item is inserted into the locations at bank addresses 
'2s' of the respective banks BANKO and BANK1, i.e., at the addresses '4' and '5' of 
the stack storage 300. On the other hand, the multiplexer 252 selects the adder's 
content ADD<3:0> of '001 1* (= 3) in response to the select control signal SB1. 
The MUX1<3:0> or ADD<3:0> is naturally provided to the high-order bits 
MSP<4:1> of the main stack pointer 210 with the least significant bit MSP<0> of 
logic '0', so that the main stack pointer 210 contains '001 10' (= 6) as illustrated in 
Fig. 4C. 

Next, during a one-word pop operation (assuming as in the above example 
that the main stack pointer 210 contains the address '4'), the adder 251 adds to 
the MSP<4:1> of '2' and the multiplexer 254 is disabled by the MUX enable 
signal EB2. The multiplexers 252 and 253 select the content ADD<3:0> of T 
from the adder 251 in response to the select control signals SB1 and SB2, 
respectively. Accordingly, the bank stack pointer 240 points to the bank address 
of T of the BANK1, i.e., at the address '3' of the stack storage 300, from which 
the top one-word item is deleted. The MUX1<3:0> of T is provided to the 
MSP<4:1> of the main stack pointer 210 and the MSP<0> is toggled to T in 
response to the control signal SB4 from the control logic circuit 255 (refer to Fig. 
7A), so that the main stack pointer 210 contains '0001 V (= 3) as illustrated in Fig. 
4D. 

Next, during two-word pop operation (assuming as in the above example 
that the main stack pointer 210 contains the address f 4'), the adder 251 adds -1' to 
the MSP<4:1> of '2' in response to the instruction decoding signal DSCB1, and 
the multiplexer 252 selects the ADD<3:0> of T in response to the select control 
signal SB1. The multiplexers 253 and 254 select the ADD<3:0> of T in response 
to the select control signal SB2 and SB3, respectively. As a result, the first and 
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second bank stack pointers 230 and 240 each contain T. Accordingly, the two- 
word item is removed from the locations at the bank addresses 'Is' of the 
respective banks BANKO and BANK1, i.e., at the address '2' and '3' of the stack 
storage 300. The MUX1<3:0> of T is provided to the MSP<4:1> of the main 
5 stack pointer 210, so that the main stack pointer 210 contains '00010' (= 2) as 
illustrated in Fig. 4E. 

Referring to Figs. 5A through 5E, 6 and 7B, various examples of stack 
operations will be described under the assumption that the main stack pointer 210 
points to the location at the odd address of '5' (= 00100) (as illustrated in Fig. 5 A). 
10 To begin, during a one- word push operation, the adder 25 1 adds T to the 

MSP<4:1> of '2' in response to the instruction decoding signal DCSB1, and the 

,s=tr : 

multiplexer 254 is disabled by the MUX enable signal EB2. The multiplexer 252 
selects the ADD<n-l :0> of '3' and outputs MUX1<3:0> in response to the select 
W control signal SB1 . In addition, the multiplexer 253 selects the MSP<4:1> of '2' 

HJ 15 in response to the select control signal SB2, so that the second bank stack pointer 

W 

, 240 contains '2\ Accordingly, a new one- word item is stored in the location at the 

m bank address '2 1 of the second bank BANK1, i.e., at the address '5' of the stack 

iff : 

{JJ storage 300. The MUX1 <3 :0> of '3' is fed back to the main stack pointer 2 1 0 and 

jjj the control logic circuit 255 controls the main stack pointer 210 to toggle the 

20 MSP<0> of ' 1 * (refer to Fig. 7B) with the control signal SB4, thereby causing the 
main stack pointer 210 to point to the location at the address '001 10' (= 6) of the 
stack storage 300 as illustrated in Fig. 5B. 

Next, during a two-word push operation (assuming as in the above example 
that the main stack pointer 2 1 0 contains '5'), the adder 25 1 adds '+ V to the 
25 MSP<4: 1 > of '2' in response to the instruction decoding signal DCSB 1 . The 

multiplexers 253 and 254 both are enabled in response to the MUX enable signals 
EB1 and EB2, respectively. The multiplexer 253 selects the MSP<4:1> of '2' in 
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response to the select control signal SB2, so that the second bank stack pointer 
240 contains the bank address of '2*. The multiplexer 254 selects the ADD<3:0> 
of '3' in response to the select control signal SB2, so that the first bank stack 
pointer 230 contains ! 3\ Accordingly, a new two-word item is inserted into the 
locations at bank addresses '3 ! and '2 1 of the respective banks BANKO and 
BANK1, i.e., at the addresses , 6 ? and '5' of the stack storage 300. The multiplexer 
252 selects the adder's content ADD<3:0> of '3* in response to the select control 
signal SB1 . The MUX1<3:0> of '3' is provided to the high-order bits MSP<4:1> 
of the main stack pointer 210 with the least significant bit MSP<0> of T, so that 
the main stack pointer 210 contains '001 IT (= 7) as illustrated in Fig. 5C. 

Next, during a one-word pop operation (assuming as in the above example 
that the main stack pointer 210 contains '5'), the multiplexer 254 selects the 
MSP<4:1> of f 2 ! in response to the MUX enable signal EB2 and select control 
signal SB3 while the multiplexer 253 is disabled by the MUX enable signal EB1, 
so that the first bank stack pointer 230 contains '2\ Accordingly, the top one-word 
item is removed from location at the bank address '2' of the first bank BANKO, 
i.e., at the address '4' of the stack storage 300. In addition, the multiplexer 252 
selects the MSP<4:1> in response to the select control signal SB land outputs 
MUX1<3:0> of '2 1 , which is then fed back to the main stack pointer 210. The 
control logic circuit 255 generates the control signal SB4 to toggle the MSP<0> of 
T (refer to Fig. 7B), so that the main stack pointer contains '00100' (= 4) as 
illustrated in Fig. 5D. 

Next, during a two-word pop operation (assuming as in the above example 
that the main stack pointer 210 contains '5'), the adder 251 adds '-1' to the 
MSP<4:1> in response to the instruction decoding signal DSCB1 and the 
multiplexer 252 selects the ADD<n-l:0> of T in response to the select control 
signal SBL The multiplexer 253 selects the ADD <3:0> of T in response to the 
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select control signal SB2, whereas the multiplexer 254 selects the MSP<4:1> of f 2 f 
in response to the select control signal SB3. As a result, the first bank stack 
pointer 230 contains '2 f and the second stack pointer 240 contains T. 
Accordingly, the two-word item is removed from the locations at the bank 
5 addresses '2' and T of the respective banks BANKO and BANK1, i.e., at the 

address '4' and '3' of the stack storage 300. The MUX1<3:0> of T is provided to 
the MSP<4:1> of the main stack pointer 210, so that the main stack pointer 210 
contains '0001 1' (= 3) as illustrated in Fig. 5E. 

As described above, an embedded hardware stack having an architecture in 
10 accordance with the teachings herein can advantageously perform multi-word 

stack operations, as well one-word operations, thereby improving the performance 
of processors having such stack architectures. 
fi Although the present invention and its advantages have been described in 

detail, it should be understood that various changes, substitutions and alterations 
15 can be made herein without departing from the spirit and scope of the invention as 
* defined by the appended claims. 

' : ri z 
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